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Abstract 

The basics of set theory are usually copied, directly or indirectly, 
by computer scientists from introductions to mathematical texts. Often 
mathematicians are content with special cases when the general case is 
of no mathematical interest. But sometimes what is of no mathematical 
interest is of great practical interest in computer science. For example, 
non-binary relations in mathematics tend to have numerical indexes and 
tend to be unsorted. In the theory and practice of relational databases 
both these simplifications are unwarranted. In response to this situation 
we present here an alternative to the "set-theoretic preliminaries" usually 
found in computer science texts. This paper separates binary relations 
from the kind of relations that are needed in relational databases. Its 
treatment of functions supports both computer science in general and the 
kind of relations needed in databases. As a sample application this paper 
shows how the mathematical theory of relations naturally leads to the re- 
lational data model and how the operations on relations are by themselves 
already a powerful vehicle for queries. 



1 Introduction 

Mathematics is more useful than computer scientists tend to think. I am not 
talking about specialized areas of computer science with strongly developed 
mathematical theories, like graph theory in computer networks or posets and 
lattices for programming language semantics. These are solid extensions of the 
traditional notion of Applied Mathematics. 

No, I'm talking about the mundane mathematics that falls into the cracks 
between these well-developed areas: basic facts about sets, functions, and rela- 
tions. I refer to the material that gets relegated to the desultory miscellanies 
added to texts under the name of "set-theoretic preliminaries" . The reason for 
the perfunctory way in which these are compiled is that computer scientists are 
ambivalent about basic set theory: they don't care for that stuff, yet dare not 
omit it. 
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My own lack of understanding in this area resulted in difficulties with the 
relational model. Accordingly, I include it as a case study of how basic set theory 
could have clarified these difficulties, which are also found in the literature. 

The ambivalence concerning the set-theoretic preliminaries is understand- 
able. Is a proper treatment not going to lead to an unacceptably large mass 
of definitions and theorems? After all, set theory is nothing if it is not done 
rigorously. In other parts of mathematics you can start right away with what 
interests you and you refer to something else for the foundation. No such help 
is available when you deal with the foundation itself. 

This paper is intended to improve this situation. My model is Halmos's 
Naive Set Theory 0], which shows that one can be precise enough and yet 
easily digestible without losing anything essential. Halmos's book was aimed at 
normal mathematicians; that is, those who have no interest in set theory, yet 
cannot do without it. It is therefore ideal for computer scientists, who are in 
the same position. Though set theory is no different for mathematicians than it 
is for computer scientists, it does make sense to rewrite the first part of @] into 
a sort of Halmos for computer scientists. This is what I am trying to do here. 

Halmos had to maintain a delicate balance between confficting requirements. 
On the one hand he wanted to avoid an arid listing of the facts and definitions 
that normal mathematicians need. Such a bare listing would not do justice to 
the intrinsic interest of the subject. On the other hand, he did not want to get 
sucked into the depth, and richness, of the subject. I have tried to follow his 
example. On the one hand, I give more than a minimal listing of facts. For 
example, I even start with a history of set theory. To maintain the balance, I 
kept it to less than five hundred words. In the sequel I have tried to continue 
this balance. 

These considerations have resulted in a paper that is difficult to classify. 
It is part review, part tutorial. As a result of approaching certain subjects 
important for computer scientists, like relations, in the way that mathematicians 
had learned to do by the middle of the 20th century, certain new and useful 
results come out, so that this paper also contains recent research. But most of 
all, the purpose is methodological: to show that, before breaking new ground, 
we should go back to the basic abstract mathematics that became the consensus 
of mathematicians in the 1930s and was codified by Bourbaki in the Fascicule 
de Resultats ^ of their Theorie des Ensembles [2]. 

2 Sets 

The development of the calculus in the 18th century was a spectacular practical 
success. At the same time, philosophers poked fun at the mathematicians for 
the peculiar logic used in reasoning about infinitesimals (sometimes they were 
treated as zero; sometimes not). In the 19th century, starting with Cauchy, 
analysis developed as the theory underlying the calculus. The goal was to do 
away with infinitesimals and explain the processes of analysis in terms of limits. 
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rationals, and, ultimately, integers as the bedrock foundation^. 

In the late 19th century Cantor was led to create set theory in response to 
further problems in analysis. This theory promised to be an even deeper layer 
of bedrock in terms of which even the integers, including hierarchies of infinities, 
could be explained^. 

For some decades set theory was an esoteric and controversial theory, plagued 
by paradoxes. The axiomatic treatment of Zermelo and Fraenkel made set 
theory into respectable mathematics. A sure sign of its newly achieved status 
was that Bourbaki started their great multivolume treatise on analysis with a 
summary of set theory P] to serve as foundation of the entire edifice. 

When set theory is not just viewed as one of the branches of mathematics, 
but as the foundation of all mathematics, it is tempting to conclude that the 
proper, systematic way to teach any kind of mathematics is to start with set 
theory. Combine this idea with the curious notion that children are in school not 
just to learn to reckon, but to do "Math" , and you get a movement like "New 
Math" according to which children are taught sets before they get to counting. 

By the time the disastrous results of this approach were recognized, set 
theory was out of favour. Even pure mathematicians were no longer impressed 
by the edifice erected by Bourbaki. Set theory was relegated to the status of 
one of the many branches of mathematics; one distinguished by the utter lack 
of applications. Mathematicians interested in foundations of mathematics and 
theoretically minded computer scientists use categories rather than set theory. 
A characteristic quote from this part of the world is: "Set theory is the biggest 
mistake in mathematics since Roman numerals." 

For the kind of things that these people do, category theory may well be 
superior. What I hope to show here is that a bit of basic old-fashioned set 
theory goes a long way in improving what the rest of us do. 

2.1 Set membership and set inclusion 

What is a set? 

I shall remain evasive on this point. In geometry, there are definitions of 
things like triangles and parallelograms, but not of points and lines. The latter 
are undefined primitive objects in terms of which other geometrical figures are 
defined. All we are allowed to assume about points and lines is what the axioms 
say about the relations between them (like there being one and only one line on 
which two given points lie). 

In the same way it is not appropriate to ask "What is a set?" . All we know 
about sets is that they enter into a certain relation with another set or with an 
element and that these relations have certain properties. Some of these I review 
here without attempting completeness. 

^Leopold Kronecker (1823-1891) is widely quoted as having uttered: "Die ganze Zahl schuf 
der liebe Gott, alles Uebrige ist Menschenwerk." (God made the integers, all else is the work 
of man). 

^Professor Kronecker did not take kindly to these new developments and intervened to 
thwart Cantor's career advancement. 
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If S' is a set, then x S means that x is an element (or member) of S. The 
symbol G stands for the m,embership relation. If S and T are sets, then S CT 
means that every element of S (if any) is an element of T (if any). We say that 
S* is a subset of T and that T is a superset of S. Thus for every set S, we have 
S C S. Many authors use "C" to indicate the subset-superset relation. 

If we have both S C T and T c S, then S and T have the same elements 
and we write S = T. An important property of sets is that if S and T have 
the same elements, then S is the same set as T. That is, a set is completely 
determined by the elements it contains. A set is not one of those wholes that 
are more than the sum of their parts. 

To specify a set we only need to tell what elements it has. We can do this in 
two ways: element by element or by giving a rule for membership. The notation 
for the first way is to list the elements between braces, as in 

5 = {*,p,t,5,f}, 

which says that the elements of S are p, f, 5, and f , and that there are no 
other elements. As a set is determined by its elements, the order in the listing 
is immaterial. 

The other specification method is by a rule that determines the elements of 
a set; for example 

{x \ X G J\f and x is not divisible by 2} 

for the set of odd numbers, where Af is the set of natural numbers. In this style 
p, t, 5, ^} can be specified as 

{a;|a; = <|kora; = pora; = torx = 5ora; = ^}. 

The set with no elements is called the empty set or the null set, which we 
write as {} or as 0. The empty set is a subset of every set^. 

Set operations If S and T are sets, then 

{x\x € S OT X €T} 
{x\x € S and x gT} 
{x\x € S and x ^T} 

are also sets. SUT is known as the union of S and T, SdT as the intersection, 
and S* \ T as the set difference. The expression x occurring in the definition 
oi S \ T means that x is not a member of T. 

One can characterize S UT and S (iT, respectively, as the least common 
superset and greatest common subset of S and T. 

^It is even a subset of the empty set. 
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Sets of sets Elements of a set can themselves be sets. For every set S we 
define the powerset ViS) of 5* as the set of all subsets of 5*. For example, 



P{{a,b,c}) 



{{}, {a}, {b}, {c}, {a, b}, {a, c}, {b, c}, {a, b, c 
{{}} 

{{},m} 



■}} 



Let S be a nonempty set of sets. Then US' is defined as {x \ 3S' G S .x £ S'} 
and nS as {x\yS' G S .xG S'}. 

Let 5* be a set of sets and let 5*0 and 5*1 be non-empty subsets of S. Then 
So C Si implies USq C USi. It also implies that nSi C DSq. 

The naive point of view What makes sets interesting is that their elements 
can be sets. Then of course one also has sets that contain sets that contain 
sets, and so on. It is easy to get carried away and consider things like {x \ x ^ 
x}. To assume that such a thing is a set leads to contradiction. As a result, 
mathematicians have learned to be careful. This care takes the form of axioms 
that explicitly state what things are allowed to be sets. Nothing is a set unless 
these axioms say it is. This is axiomatic set theory. 

In this paper I do not try to justify the existence of the sets that we want 
to talk about. That is the "naive point of view". According to axiomatic set 
theory, no things exist except sets. It is naive to assume that a set such as 
p, t, 5, ^} exists. Axiomatic set theory requires one to justify the existence 
as sets of ^jt, p, f, 5, and ^. 

It is the hallmark of naivete to emphasize, as I often will do, that the elements 
of a set are sets: in axiomatic set theory, every nonempty set answers this 
description. 

As an example of familiar objects being reconstructed as sets, let us consider 
numbers. One of the things set theory is expected to do is to explain and to 
justify the various types of numbers. When starting with sets only, we cannot 
say that p, f , 5, %} has five elements, or even that this set has more elements 
than {4, p, 5, ^}. "Five" and "more" needs to be explained in terms of sets. We 
cannot even say that (or whether) a set has a finite number of elements without 
a set-theoretic explanation of finiteness. 

While remaining naive, we should still get some idea of how axiomatic set 
theory could justify the existence of the kind of sets that we work with. Con- 
sider the natural numbers 0, 1, 2, . . . Strictly speaking such numbers are either 
"ordinal" or "cardinal". Ordinal numbers refer to locations in a linear order; 
cardinal numbers refer to sizes of sets. The construction of ordinal numbers in 
terms of sets is simplest, so we'll just consider that. As the natural numbers 
are all finite anyway, the distinction between cardinal and ordinal numbers need 
not concern us. 

The axioms guarantee for every set S the existence of its unique successor 
set S~^, which is SU {S}. This at least permits us to say that S~^ has one more 
element than S, even if we don't know how to count the elements of S. 



5 



As the axioms also allow the existence of the empty set 0, we have also the 
sets 0"*", 0++, and so on, all the way up to infinity (and beyond). These are 
considered to be the ordinal numbers 0, 1, 2, and so on, respectively. Before 
set theory it was already shown how one could get from the set M of natural 
numbers successively the set I of integers, the set Q of rationals (that's simple), 
and, from the rationals, the set TZ of reals (not so simple). 

If S'+ = 5 U {5}, then S C S+ . As a result, C 0+ C 0++ C • ■ ■ In fact, 
one can see that = {}, 0+ = {0}, 0++ = {0, 0+}, . . . : Every ordinal number 
is the set of its predecessors. So we could just consider the natural numbers 
as finite ordinals and consider the natural number to be and every positive 
natural number n to be {0, . . . , n — 1}. Instead of "for all i = 0, . . . , n — 1" 
we could be rid of the ambiguous ". . ." and write "for all i G n" . Though this 
consequence of a natural number n being a set is not widely used, it is concise 
and unambiguous. 

2.2 Pcirtitions 

Let P be a set of non-empty subsets of a set S. Then we have UP C S. If we 
also have UP = S, then P is a cover for S. The elements of P are called the 
cells of the cover. If a cover P for S is such that the sets of P arc mutually 
disjoint, then P is a partition of S. 

A partition P' of 5* is said to be finer than P if, for every cell C of P', there 
is a cell C of P such that C C C. This makes every partition finer than itself, 
which is fine (in mathematics). It makes {{x} \ x G S} the finest partition of 
S, and {{5'}} the coarsest. 

2.3 Pairs 

The order in which we list the elements of a {a, b} is an artifact of the notation. 
The equality of {a, b} and {b, a} can perhaps be seen most clearly by writing 
both as {a; I a; = a V a: = 6}. 

How does one use sets not only to bundle a and b together but also to place 

them in a certain order? The usual way to do this is based on the observation 
that, though {a,b} and {b,a} are the same set, {{a}, {a, 6}} and {{6}, {a, 5}} 
do not have the same elements, so are not the same set. We call {{a}, {a, b}} a 
pair, and write (a, 6). The left element of (a, 6) is a; its right element is b. 

In a similar way one can define triples {a,b,c) as ((a, fe),c) or (a, (6, c)) 
quadruples {a,b,c,d), and so on. 

3 Binary relations 

A relation among sets associates some of the elements in each of the sets with 
each other. In this section, I present only binary relations: relations between 
two sets or between a set and itself. Relations between any number of sets are 
treated later, independently of binary relations. 
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Everything so far in this paper is standard. However, from now on authors 
differ on some of the terminology and definitions, though the underlying con- 
cepts are universally accepted. Hence the occasional Definition from this point 
onward. 

Definition 1 (binary relation, source, target, extent, ^) A binary rela- 
tion is a triple of the form {S, T, E) , where S and T are sets and E is a set of 
pairs. If E is nonempty, then {x, y) G E implies x ^ S and y G T. S is the 
source, T is the target, and E is the extent of {S,T,E). The set of all binary 
relations with source S and target T is denoted by the expression S' T. 

Definition 2 (binary Cartesian product) The binary Cartesian product of 
sets S and T is defined to be {{x,y) \ x £ S Ay £T}. It is written as S xT. 

Definition 3 (empty binary relation, universal binary relation) The empty 
binary relation in S ^ T is the one of which the extent is the empty set of pairs. 
The binary relation in S T that has S x T as extent is the universal binary 
relation, denoted U{S,T). 

3.1 Set-like operations and comparisons 

Among the binary relations in 5 T certain operations are defined that mirror 
set operations: 

{S,T,Eo)U{S,T,Ei) =^ {S,T,EoUEi) 
{S,T,Eo)n{S,T,Ei) {S,T,EonEi) 
{S,T,Eo)\{S,T,E,) =^ {S,T,Eo\E,) 

Comparison of sets carries over in a similar way to binary relations: 

{S,T,Eo)c{S,T,Ei) iff EoCEi 

3.2 Properties of binary relations 

Consider a binary relation {S, T, E). It has any subset of the following proper- 
ties: 

• total: for every x G S there exists y gT such that (a;, y) G E. 

• single-valued: for all x G S, yo G T, and yi G T, {x,yo) G E and 
{x,yx) G E imply yo = Vi- 

• surjective: for every y gT there exists x G S such that (a;, y) G E. 

• injective: for all y G T, xo G S, and xi G S, {xo,y) G E and {xi,y) G E 
imply xo=Xi. 
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Definition 4 A partial function is defined to be a single-valued binary relation. 
A functional binary relation is a partial function that is total. 

We reserve "function" for the similar concept that will be defined later in- 
dependently of binary relations. Some authors, however, prefer to regard a 
function as the special case of a binary relation that is single-valued and total. 

3.3 Other operations on binary relations 

In addition to the set-like operations defined in section 13.11 the following are 
defined for binary relations. 

The inverse of the binary relation (S*, T, E) is given by 

{S,T,E)-' = {T,S,{{y,x) \ {x,y)eE}). 

For every binary relation r we have that {r^^)^^ ~ r. 

It is apparent from this definition that every binary relation has an inverse: 
no properties are needed. For example, every partial function / has an inverse 
/~^, when regarded as binary relation. But /"^ is not necessarily a partial 
function: for that to be true, / needs to be injective. 

Definition 5 (composition) The composition ";" of two binary relations is 
defined when the target of the first is the source of the second. In that case it is 
defined by 

{S, T, E); (T, U, F) {S, U, {{x, z) \3y e T.{x, y) e E A {y, z) e F}> 
If To; ri is defined, then so is r^^; r^^, and it equals (ro; ri)^^. 

3.4 Endo-relations 

A binary relation where source and target are the same set may well be called 
an endo-relation. Some of the most commonly used binary relations are of this 
kind. 

def 

For every set S we define the identity on S as idg = {S, S, {{x,x) | x e S}). 
The identity relations allow, in combination with composition, succinct char- 
acterizations of various properties that a relation r in S* ^ T may have: 

single- valued iff r^^;r cidr 

surjective iff r~^;r Didr 

injective iff r; C idg 

total iff r; r^^ D idg 
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Equivalence relations Let us consider the endo-relation r = {S, S, E). 

If ids C r, then r is said to be reflexive. 
If r — r^^ , then r is said to be symmetric. 
If r; r C r, then r is said to be transitive. 

An endo-relation that has these three properties is caUed an equivalence. 

Consider {{y \ {x, y) G E} \ x G S}. If r is reflexive, then this is a cover for 
S. If r is an equivalence, then it is a partition. 

Some order relations If we drop from equivalence the symmetry require- 
ment, then we are left with a weak type of order called a pre- order. Even the 
inclusion relation among the subsets of a set is stronger: in addition to reflexiv- 
ity and transitivity, it has the property of also being antisymmetric, which can 
be defined as r n C ids. An antisymmetric pre-order is a partial order. 

Partial orders are called thus because not every pair of elements has to be 
in the relation. A partial order r such that r U r^^ — U{S,S), the universal 
relation on S, is said to be order-total. The qualification "order-" serves to avoid 
confusion with the property of being total that was defined earlier. A partial 
order that is also order-total is called a total order, or a linear order. 

4 Functions 

The following definition is independent of the definition of the functional binary 
relation in Definition 0J 

Definition 6 (function, source, target, map, — >) A function consists of a 
set S, its source, a set T, its target, and a map, which associates to every 
element of S a unique element ofT. We write S T for the set of all functions 
with S as source and T as target. 

S is often called the "domain" of the function. However, I prefer to reserve 
"domain" for the different meanings that are entrenched in subspecializations 
of computer science such as programming language semantics, constraint pro- 
cessing, and databases. The arrow notation suggests "source" as the needed 
alternative for "domain"; "target" is its natural counterpart. 

The set S" — > T is said to be the type of f £ S ^ T. One often sees 
"/ : T" when / e ^ T is meant. 

S and T are typically nonempty, but need not be. S and T may or may not 
be the same set. 

If the map of / associates the element y in T with the element x in S, then 
we call y the value of the function at the argument x. We may also say that 
/ maps X to y. We write y as f{x) to emphasize that y is determined the 
argument x. 

Example 1 Given f G S T , we can define the binary relation {S, T, {{x, f{x)) \ 
X e S}) . It is a functional binary relation. 
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Given a functional binary relation {S,T,E), we can define a function in 
S T whose map associates x with the unique y such that {x,y) G E, for every 
xeS. 

The arrow notation 5 — > T makes it easy to remember which set is the source 
and which is the target. S* — > T is sometimes written as . This notation 
makes it easier to confuse source and target. Here is a way to remember which 
is which. Let us define, for some sets at least, \S\ as the number of elements of 
S. Then it can be seen that \S ^ T\ = iTjl'^l. 

This formula is remarkably resistant to perverse special cases of which we 
now consider a few. Consider {x} —^ T, where the source is a one-element set. 
As any such function has to associate one element of T with x, T cannot be 
empty. It is easy to count the functions of this type: there are as many such 
functions as there are elements in T. The set ^ T is reckoned to have one 
function in it: there is only one way of associating an element of the target with 
each element of the source if there are no elements in the source. 

4.1 Multiple arguments 

Functions have one argument. But source and target can themselves have struc- 
ture and this can be arranged in such a way as to suggest multiple arguments. 

Consider / e S'o ^ (^i T). With / of this type, xq £ Sq implies that 
f{xo) G Si ^ T and xi G 5*1 implies that f{xo), if given argument xi, yields as 
value (/(a;o))(xi) eT. 

Instead of (/(a;o))(a^i) we will write f{xo,xi). Similarly, we will regard 
f{xo, ■ . ■ , Xn-i) as shorthand for the value of a function in 

S„^{Si^ > {Sn-l ^ T) • • •) 

at arguments xq G So, ■ ■ ■ ,Xn-i G Sn-i- This construction is called "Currying" 
after the logician Haskell B. Curry. 

Example 2 Consider g{x) — y with g (z U V . Here y is determined by 
two things: g and x. Hence there is a function, say a, that has value y for 
arguments g and x. This function is in {U V) ^ {U ^ V). Compare this 
with two-argument Currying as in f €z Sq ~* {Si T) and let Sq correspond 
to (U — > V) and Si to U . Then we see that a is a two argument function with 
g G U V as first argument and x d U as second argument. In functional 
programming, this is the function called APPLY. 

4.2 Expressions 

Definition specifies a map as being part of a function. It does not specify how 
this map associates source elements with target elements. One of the ways in 
which this can be done is by an expression E typically containing x if the value 
of E is defined for every value of a G S' substituted for x and if the values of E 
are in T. We can then find y as the result of evaluating E. 
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The expression can be a program or it can be an arithmetic or symbohc 
expression. To indicate that the map associates a with E[x/a]^ one writes 
E. 

E by itself does not specify a function, as such a specification requires the 
source and the target. E may not even be sufficient as specification of the map. 
For example, when E is x'^+y, then x ^ E and y ^ E are the maps of different 
functions. If there is an expression that specifies the map of a function, then 
there are typically other ones as well. For example, if x G 7?,, then one may 
prefer (a; + 1)^ to a;^ + 2a; + 1. A reason could be that one prefers to avoid 
multiple occiirrcnccs of a variable. 

The notation x ^ E is, similar Xx.E, as in lambda calculus. The arrow 
notation has the advantage of requiring one symbol less than Xx.E. Moreover, 
when there are two arguments, as wc have here with x and E, infix is the most 
convenient notation (compare "2 + 2" with "+2.2"). 

Example 3 f M N with m,ap n ^ {{I + ^5)" - (1 - V5)")/(2"y5; is 
the function that gives the n-th Fibonacci number. 

4.3 Properties of functions 

Note the asymmetry in the definition of a function / S 5 — > T: with every 
X & S the map of the function associates a unique y & T. 

The function f G S ^ T may associate more than one x G S with the same 
y GT. If, on the other hand, f{xi) = ,f{x2) implies that xi = X2. then this is a 
property that not all functions have. A function that has this property is said 
to be injective. 

It is not necessarily the case that civery y G T is equal to f{x) for some 
X G S. If / has this additional property, then it is said to be surjective. 

To be able to count elements in a set we have to define a suitable notion of 
number. It would be nice and tidy if this notion of number of elements in a set 
were based on sets. As we have not done this, we cannot officially count the 
elements in a set. But with injectivity and surjectivity we can at least compare 
sizes of sets: if / € S ^ T is surjective, then there have to be at least as many 
elements in S as there are in T; if / G S* ^ T is injective, then there have to 
be at least as many elements in T as there are in S. If / is both injective and 
surjective (such a function is said to be bijective), then S and T have the same 
number of elements, however "number" is defined. 

If / e 5 ^ T is a bijection, then there exists a function g in T ^ S that 
associates with every ?y G T the unique x G S such that f{x) = y. This g is 
uniquely determined by /, is written f~^, and is called the inverse of /. It is a 
bijection; hence has an inverse {f~^)~^, which is equal to /. 

If there exists a bijection in n ^ S, for some natural number n, then wc 
can define S to have n elements. In that case, S is said to be finite, otherwise 
infinite. S is countably infinite if there exists a bijection in A/^ — > 5. If two sets 
are infinite, then it is not necessarily the case that there is a bijection between 
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them. For example, there is no bijection in Q — > 7?., where Q and TZ are the 
sets of rational and real numbers, respectively. 

Definition 7 (identity, restriction, insertion) The function in S ^ S with 
map X X is ids, the identity function on 5. (I count on the context to prevent 
confusion with the identity that is a binary relation. ) 

Let f be a function in S ^ T and let S' be a set. The restriction of f to S' 
is written f I S' and is defined'^ as the function in S (1 S' ^ T that has the 
mapping that associates f{x) € T with every x (if any) in S H S' . 
If S' C S, then ids I S' is called the insertion function determined by the subset 
S' ofS. 

Another function solely determined by subsets is the characteristic function for 
S", which is in S" ^ 2 and maps a; in 5 to 1 if a; £ S' and maps x to otherwise. 

4.4 Function sum 

Definition 8 (function sum) Functions fo G Sq Tq and /i G 5*1 — > Ti are 

summable iff for all a; e 5*0 n S*! , if any, we have fo (x) = fi {x) . If this is the 
case, then /o + /i, the sum o//o and fi, is the function in (SqUSi) — > (TgUTi) 
with map x i-^ faix) if x ^ So and x i-^ fiix) if x G Si. 

Example 4 Let fo e {a,b} {0,1} such that a i-^ and b i-^ 1. Let fi G 
{b,c} — > {0, 1} such that & i—s- 1 and c i— > 0. Then fo and fi are summable and 
(/o + /i) G c} — > {0, 1} such that a i— s- 0, & i— > 1, and c i— s- 0. 

Remarks about summability: Given /o G So — > Tq and fi G Si ^ Ti. fo i Si 
and fo are summable, as are fo I Si and fo I (So \ Si) and /o = /o + (/o i 
^i) = (/o i Si) + {fo i {So \ Si). 

If fo and fi are summable, then 

• /o + .fi = .fo + .fi I {Si \ So) = fi + fo i {So \ Si) 

• if, moreover. So H 5*1 is nonempty, then Tq n Ti is. 

4.5 Function composition 

Definition 9 (composition) Let / G S* — > T and g G T ^ U . The composi- 
tion g o f of f and g is the function in S U that has as map x i— > g{f{x)). 

See Figure We sometimes write g o f without explicitly saying that it is 
defined; that is, that the target of / is the source of g. 

Example 5 For all f G S T we have f o 1^5 — f idr o /. Let S' be a 
subset of S and let i be the insertion function in S' S. Then f [ S' — f o i. 
See Figure^ 

*Some authors restrict the notion of restriction to the case where S' C S. 
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Figure 1: Top left: f e S ^ T. Top right: h = go f. Bottom left: S' C S 
and z G S" — > 5 is the insertion ftmction of S' as a subset. The diagram 
shows ,f' = ,f i S' = f o i. Bottom right: gf = gof^hg~hog, and 
hgf ^ ho {g o f) ^ {ho g) o f. 

The order of / and g in g o / is derived from the expression g{f{x)) in the 
definition of composition. In many situations this order seems unnatural. If 
the / and g were functional binary relations, then their composition would be 
written as /; g. 

Function composition is associative: if, in addition to Definition |3 we have 
h £ U —^V, then {h o g) o f and h o [g o f) are the same function in S V . 
See Figure n 

A useful property of composition is that if g o / is injective, then / is. For 
suppose that / is not injective. Then there exist xi and X2 in S such that 
xi X2 and f{xi) = f{x2)- Then we also have that g{f{xi)) = g{f{x2)), so 
that g o f would not be injective. 

The dual counterpart of this property is that if g o f is surjective, then g is. 
For suppose that g is not surjective. Then there is a z in [/ such that there is 
no y in T with g(y) = z. Then there is also no x in S with g{f{x)) = z. 

Left and right inverse Let S and T be nonempty sets, let / £ S" ^ T, and 
let g £ T S. If ids — 9 ° then we say that / is a right inverse of g and 
that 5 is a left inverse of /. In this case, / is injective and g is surjective. 

To see why, note that ids is injective and surjective. Hence, by the property 
of composition just introduced, ids — 9 ° f implies that / is injective and that 
g is surjective. 

If, in that case, / is not surjective, then there exists a y in T such that there 
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^ __ ^--^ 

g ■ - 9 

Figure 2: On the left: go f = ids. For this, / needs to be injective, but need 
not be surjcctive. As it is not, g is not uniquely determined: g{l) could be a 
and we would still have g o f = idg. On the right: f o g = idr- For this, g 
needs to be injective, but need not be surjective. As it is not, / is not uniquely 
determined: f{b) could be m and we would still have f o g = idy- 

does not exist an a; in 5 such that f{x) = y. For such a y, the value of g does 
not affect go f. Hence, / and idg = go f do not in general uniquely determine 
9- 

But ids =9of and / surjective do determine g. In that case / is a bijection, 
so that the inverse exists. The unique g = is a bijection as well, so 
that g is also injective. 

Conversely, injectivity of f £ S ^ T implies the existence of a left inverse 
g gT ^ S such that g o f = ids, which is then surjective. 

4.6 Set extensions of functions 

Whenever we have a function type 5 — > T, it is natural to consider the type 
P{S) V{T). Let / be in 5 ^ T. We call a set extension of / any function 
g e V{S) V{T) such that for all subsets S' of S we have that {/(s) | s G 
S'} C g{S'). 

This definition suggests a partial order among the set extensions of a given 
/ e 5 — > T. We define go ^ gi for set extensions go and gi of / iff go{S') C 
gi{S') for all subsets S' of S. The relation ^ is a partial order. In this partial 
order there is a least element. Its map is S' i-^ {f{x) \ x G S'} for all subsets S' 
of S. There is a greatest element in the partial order. Its map is 5' T for all 
subsets S' of S. 

The least set extension of / G 5 ^ T, the one that has as map S' i-^ {/(s) | 
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s £ S'}, is called the canonical set extension of /. Conversely, the function h in 
V{T) V{S) such that for all subsets T' of T 

h{T') ^{xeS\ fix) e T'} 

is the inverse set extension of /. Note that the inverse set extension is defined 
for any function, whether it is a bijection or not. 

Usually, the canonical set extension of / is just written as /, and the inverse 
set extension as /^^, so that we write f{S') — {f{x) \ x £ S'} and f^^{T') = 
{x £ S \ f(x) G T'}. Conventional notation relies on context to disambiguate 

, because it may mean the inverse of a bijection / or it may mean the inverse 
set extension of an / that is not necessarily a bijection. 

An important property that distinguishes some set extensions is their mono- 
tonicity: Si C 5*2 implies f{Si) C /(5'2)- Monotonicity gives us f{Si H S2) C 
f{Si) and f{Si n S2) C /(S'2), so that f{Si) and /(S'2) are both supersets of 
f {Si n S'2). As f{Si) n /(S'2) is the least common superset of /(Si) and /(S2), 
we have 

/(SinS2)c/(Si)n/(S2). 

But the reverse inclusion does not in general hold. For example, suppose that 
f{xi) = f{x2) = y and xi 7^ X2 and let us take Si = {xi} and S2 = {x2}- Then 
y £ f{{xi})r\f{{x2}), whereas y is not contained in f{{xi}r\{x2}) = /(0) = 0. 
Injectivity of / is a necessary and sufficient condition for equality. 
On the other hand, wc do have 

/(Si)U/(S2) = /(SiUS2) (1) 

That the left hand side is contained in the right hand side follows from mono- 
tonicity and union being the least common superset. For the reverse inclu- 
sion suppose that y G /(Si U S2). Then there is an a; g Si U S2 such that 
fix) e /(S1US2). If the x is in Si, then/(a;) is in /(Si); hence in /(Si)U/(S2). 
Otherwise, the x is in S2, hence fix) is in /(S2); hence in /(Si)U/(S2). Either 
way, y € /(Si) U /(S2). This shows that the right hand side of (Q) is included 
in the left hand side. 

For any / in S ^ T we have 

S' c /-i(/(S')) 

r D fif-\r)) 

with S' and T' arbitrary subsets of S and T. In the first case we have equality 
iff / is injective; in the second case we have equality iff / is surjective. 

4.7 Enriched sets 

S and T are the same set if and only if every element of S belongs to T and 
vice versa. Thus sets lack some properties, such as order, that are sometimes 
desirable in a collection. Moreover, set membership is all or nothing, not a 
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matter of degree. Many authors consider the set-theoretic concept of a set 
inadequate. This leads to various proposals for enrichment of the set concept. 

The method of set theory is to use functions to specify additional properties, 
not to enrich the concept of set itself. 

"Ordered sets" Suppose we wish to specify an order ^ among the elements 
of a finite set S. We can do this by defining a bijection f & n ^ S, where 
n G Af. For every xi G S and X2 & S there exist ii & n and Z2 G n such that 
/(ii) = xi and /(i2) = xi. Then we can define xi ^ X2 according to whether 
ii < «2- 

In set theory, S in combination with such an / takes the place of an "ordered 
set". 

"Multisets" Sometimes one considers a collection where it is not enough to 
say whether or not an element belongs, but one wants to say how many times 
it belongs. Such a collection is then called a multiset or bag. 

In set theory such a requirement poses no difficulty. It is a special case of the 
common situation that one wants to associate an attribute with each element of 
a set. Suppose we have a set 5* of objects and a set T of attributes. A function 
f £ S ^ T then specifies for each x G S which of the attributes f{x) e T it 
has. liT = M, then the attribute f{x) can be regarded as the multiplicity of x. 

In set theory, S in combination with such an / takes the place of a "multiset" . 

"Fuzzy sets" Sometimes one considers a collection where it is not enough to 
say whether or not an element belongs, but one wants to say how strongly it 
belongs. 

In set theory, one would regard the required strength of belonging as an 
attribute. The interval [0, 1] of real numbers can be selected as the set of at- 
tributes. A set S in combination with / € 5 ^ [0, 1] can then be regarded as a 

"fuzzy set" . 

5 Tuples 

Often a function in 5* ^ T models a process of which the input and output 
consist of elements of S and T, respectively. A function f G S ^ T may 
be used in a different way: as a method for indexing elements of T using the 
elements of S as index. 

When / is used in this way, then it is called a family or a tuple with S as 
index set, of which the components are restricted to belong to T. 

In such a situation one sometimes writes fs rather than /(s) for the unique 
element of T indexed hy s G S. Arrays in programming languages are examples 
of such families. Then S = n and one would write f[i] rather than f{i) where 
i G n. 
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If the index set is numerical, as in S' = n, for some n S TV, or S = TV, then 
a tuple in 5' ^ T is called a sequence. If 5 = TV, then the sequence is said to 
be infinite. If S* = n, then we call n the length of the sequence. 

If we think of T as an alphabet, then the functions in n — > T are the strings 
over T of length n. Suppose we have strings a € m T and P d n T. Then 
7 G (m + n) ^ T is the concatenation of a and /3 if the map of 7 is given by 
i I— !■ a{i) if < i < TO and i i— > (3{i — to) if to < i < to + n. 

5.1 Notation 

Parentheses in mathematics are overloaded with meanings. There is the prime 
purpose of indicating tree structure in expressions. An unrelated meaning is to 
indicate function application, as in /(x), although these two meanings seem to 
combine in f{x + y). And then we see that a sequence a; in n — > T is often 
written as (xq, . . . , x„-i). Let us at least get rid of this last variant by writing 
X G n ^ T a.s {xq, . . . , Xn-i)^. 

Tuples need not be indexed by integers. For example, points in the Euclidean 
plane are reals indexed by the X and Y coordinates. Describing points this way 
corresponds to regarding a point p as a tuple in {X, Y} TZ. For example, a 
point p could be such that px = 2 and py = 3. 

In a context where the points are thought of as being characterized by the 
coordinates X and Y one often sees p = (2,3). Apparently, a quick switch 
has been made from {X, Y} to {0, 1} as index set. We need a notation that 
unambiguously describes tuples with small nonnumerical index sets. 

Consider tuples in S* ^ T representing addresses. Here the index set S could 

be 

{number, street, city, state, zip}. 

Such tuples can be written as lines in a table with columns labeled by the 
elements of the index set. The ordering of the columns is immaterial. 

In this way addresses a, b, c, d, and e would be described as shown in 
Figure 121 





number 


street 


city 


state 


zip 


a 


19200 


120th Ave 


BotheU 


WA 


98011 


b, c 


2200 


Mission College Boulevard 


Santa Clara 


CA 


95052 




2201 


C Street NW 


Washington 


DC 


20520 


d 


36 


Cooper Square 


New York 


NY 


10003 


e 


2201 


C Street NW 


Washington 


DC 


20520 



Figure 3: A sample of named and unnamed tuples. 

In Figure 13 the area to the left of the double vertical line is not a column: 
it only serves to write the names, if any, of the tuples. The third tuple has 

potential pitfall of this notation for tuples is that (a, b) can now mean either a pair or 
a tuple p with 2 as index set such that po = a and pi = b. 
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no name; it is still a tuple. There is a tuple that has two names: b and c. 
Another tuple occurs twice in the table, once with its name shown; the other 
time without the name. 

Without a table we would have to specify laboriously a^T^ij^bQ]- = 19200, 
^street = 120th Ave, . . ., bj^^j^^^j. = 2200, ^street = Mission College Boulevard, 
. . ., and so on. Such notation is unavoidable if tables are not available. For ex- 
ample in XML^ the tuple a would be an "element" endowed with "attributes" : 

<a number = "19200" street = "120th Ave" city = "Bothell" 

state = "WA" zip = "98011" 

> 

If no tuple has a name, then one might as well omit the double vertical line, 
and describe the four tuples by means of the table: 



number 


street 


city 


state 


zip 


19200 


120th Ave 


Bothell 


WA 


98011 


2200 


Mission College Boulevard 


Santa Clara 


CA 


95052 


36 


Cooper Square 


New York 


NY 


10003 


2201 


C Street NW 


Washington 


DC 


20520 



In the same way, the point p in {X, Y} — > TZ such that px = 2 and py = 3 
should be described by 





X 


Y 


p 


2 


3 



instead oi p = (2,3), which would incorrectly imply 2 as index set. We could, 
of course, code the X-coordinate as and the y-coordinate as 1, so that it 
would be correct to write p as (2,3). The point is that we don't have to code 
familiar symbols such as X and Y for coordinates as something else for the sake 
of set-theoretic modeling. 

5.2 Typed tuples 

Consider a tuple with {number, street, city, state, zip} as index set, representing 
an address. We would like to ensure that the elements indexed by number 
and zip belong to Af and that the others belong to the set A* of alphabetic 
character strings. This is an example of typing a tuple. This involves in general 
associating a set with each index. That is, to type a tuple with index 7, we use 
a tuple of sets with index I. 

Exeimple 6 Addresses could be typed by the tuple of sets 



number 


street 


city 


state 


zip 


N 


A* 


A* 


A* 


N 



°The Extensible Markup Language, a standard promulgated by the World Wide Web 
Consortium. 
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Definition 10 (typing of tuples) Let I be a set of indexes, T a set of disjoint 
sets, T a tuple in I ^ T , and t a tuple in I ^ UT. We say that t is typed by 
T iff we have t{i) e r(i) for all i d I. 

Because the sets in T are disjoint, tliere is a function a G UT T tliat maps 
each element x of UT to the set that is an element of T to which x belongs. 
The condition of t being typed by t can be expressed by means of function 
composition; see Figure 0] 



Figure 4: The tuple t G / ^ UT is typed byrG/^Tiffr^crot. 



5.3 Cartesian products 

Definition 11 (Cartesian products) Let I he a set of indexes, T a set of 
disjoint sets, and r a tuple in I T . The Cartesian product on r, denoted 
CARt(t), is the set of all tuples that have r as type. 

Instead of " t is typed by r", we can now say " t G CART(r)". 

Example 7 Let r he in {numher, street, city, state, zip} — s- {J\f, A*}, More specif- 
ically, let T he equal to the tuple 



CART(r) is the set of all tuples of type r. In this example it means that 
any natural numher, however small or large, can occur as the element indexed 
hy number or zip, and that any sequence of characters, in any comhination, of 
any length, can occur as element indexed by street, city, or state. The set of 
actually existing addresses is a relatively small subset o/cart(t). 

Example 8 Let the index set {X, Y} contain the labels that distinguish the X 
and Y coordinates of the points in the Euclidean plane. Let t be the tuple of 
sets in {X,Y} — )■ {TZ}. Now CARt(t) is the Euclidean plane. 

Often the index set of r is n for some positive natural number n. For this 
case a special notation exists for CART(r). Let r be in n — > T, where T is a set 
of sets. The special nature of the index set allows us to write t as {tq, . . . , r„_i). 
In the same spirit, CARt(t) may be written as ro x • • • x t„_i. When T consists 



I 




number street city state zip 



M A* A* A* M 
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of a single set, say, T = {S}, then t E n ^ {S}, and tq x • • • x t„_i may be 
written as S* x • • • x S*, abbreviated as S''\ 

^ V ' 

n times 

Example 9 S x T = CART(r) where t e 2 ^ {S,T} with t(0) = S and 
t(1) = T. / trust that the context prevents confusion with the binary Cartesian 
product S X T defined as {{x,y) \ x € S A y T} (see Definition\^. 

Example 10 A pack of playing cards can be modeled as a Cartesian product. 
Let SUIT = {*, 0, 9, 4} and let VALUE = {2, 3, 4, 5, 6, 7, 8, 9, 10, J, Q, K, A}. Let 
T = (suit, value) . Then the 52 elements o/CART(r), which include e.g. (<v?, Q) , 
and (<|k,5), can be interpreted as playing cards. As the index set of t is 2, we 
can write CARt(t) as SUIT x value. 

Example 11 Let the index set L — n for some natural number n and let T = 
{??,}. Then L T consists of one typing tuple, say r. CART(r) consists of the 
points in Euclidean n-space and we write 7?." instead o/CARt(t). 

5.4 Functions and Cartesian products 

We have considered functions / in 5 T. These have one argument a; in S' 
that gets mapped into f{x) € T. 5 or T, or both, may be a Cartesian product. 

Suppose S — CARt(t), with t e L ^ U. The argument of / is a tuple 
that is not necessarily indexed by numbers. However, if / = n, then we have 
f G UqX- ■ - x Un-i T and /-values look like /((uq, . . . , m„_i)), usually written 
as /(uo, ■ • ■ , Mn-i), where uo G J7o, • ■ • , Un-i G Un-i- This is an alternative to 
Currying for modeling multi-argument functions. 

It can also happen that the target set T of a function / in S' — > T is a 
Cartesian product. In such a case it may be more natural to think of / as a 
tuple /' of functions. Suppose that / is in S' ^ CARt(t) with r e / ^ [/, where 
U is a, set of sets. Thus, for each x S S*, we have that f{x) is a tuple with index 
set /. For each i G /, {f{x))i is an element of this tuple. 

One can also think of such an / as a tuple /' of functions with index set 
I. For each i G I, we define the element of the tuple /' as the function in 
S ^ Ti with map x ^ {f(x))i. 

For every function / that has a Cartesian product as target, there is a tuple 
/' of functions uniquely determined as shown above. Conversely, for every tuple 
of functions that have the same source, the above shows the uniquely determined 
fimction with this source that has tuples as values. 

5.5 Projections 

Consider a sequence s — (a, b, c, d, e) in 5 — > A, where A is a set of alphabetical 
characters. Which, if any, of the following is a "subsequence" of s: {b,c,d), 
(a,c,e), {e,c,a), or {a,a,c}l Authors' opinions differ. 

We saw that the definition of function leads in a natural way to the notion 
of restriction. Sequences, being tuples, being functions, therefore also have 
restrictions. In set theory, subtuples are modeled as restrictions: 
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Definition 12 (subtuple) If S' C S, then f i S' is the subtiiple on S' of the 
tuple f in S ^ T . 



<a,c, e> 



■i sl{0 



\ 

1(0,2,4} 3 

<0,2,4> 



{0,2,4} 



UT 




Figure 5: Left: See Example IT^ Right: See Definition rPil The fact that t is 
typed by r is expressed by the fact that t = a ot. It follows that t oi = aotoi. 



Example 12 Let s = {a,b, c, d, e) . We could restrict the index set of s, which 
is 5, to S' = {0,2,4} and get as a subtuple of s the restriction s | S' , which 

and which is not {a,c,e). In fact, we have that {a,c,e) — {si 






2 


4 


a 

)o{ 


c 

3,2, 


e ' 
4), I, 



S')o (0,2,4), if the target of {0,2,4) is {0,2,4}. 5*66 Figure\^ 

Though tuples are functions, we saw that they come with their own termi- 
nology. We saw that it is "index set" instead of "source" and "subtuple" instead 
of restriction. In fact, there is additional specialized terminology for tuples. 

Definition 13 (projection) Let t be a tuple with index set I and let J be a 

subset of I . The projection 7rj(i) oft on J is the subtuple t I J oft. 

Thus we have 7r{o 2,4}((a, b, c, d, e)) 

When the projection is on a singleton set of indexes, the result is a singleton 

3 






2 


4 


a 


c 


e 



tuple. Rather than writing 7r{3}((a, b, c, d, e)) 
set of indexes as the index by itself: 7r3((a, b, c, d, e)) 



we may write the singleton 
3 

d ■ 



If a tuple t has type r, then the subtuples of t are typed in an obvious way. 
This observation suggests the definition below. 

Definition 14 (subtype) Let T be a set of disjoint sets, I an index set, J a 
subset of I, and t a type in I T . We define t I J to be the subtype of t 
determined by J . 

Let now t E I ^ UT be a tuple typed by t. It is easy to see that the subtuple 
i j J of t is typed by the subtype r | J of r. Observe that, if i G J ^ / is the 
insertion function, then t [ J = toi. In FigureElwe see that t [ J = a o {t [ J). 
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This shows that if we want to say that i J, J is typed by t J, J, then we can say 
it with arrows. 

A projection that is defined for an individual tuple is also defined, as canon- 
ical set extension, on a set of tuples that have the same type. Consider for 
example a Cartesian product. It is a set of tuples of the same type. Therefore 
7rj(CART(T)) is defined by canonical set extension to be equal to {7rj(t) | t G 
CARt(t)}. These are all the tuples of type ttj{t), so by definition equal to 
CART(7rj(T)). Thus we see that a projection of a Cartesian product is itself a 
Cartesian product. 

6 Relations 

In Section|21we defined binary relations. In this section we define a kind of rela- 
tion that can hold between any number of arguments. We call these "relation" . 
Whenever we refer to a binary relation, we should always qualify it as such. The 
generality of relations implies that some of them hold between two arguments. 
But these are still relations and not binary relations. The situation is similar to 
the trees in Knuth's Art of Programming jTj- He defines binary trees and trees 
in such a way that binary trees are not a special case of trees. 

Relations are specified by means of sets consisting of the tuples containing 
the objects that are in the relation. It is not enough for the sizes of these tuples 
to be the same. Just as the source and the target are parts of the specification 
of a function, a relation is most useful as a concept when its tuples have the 
same type. Therefore this type is part of the specification of a relation. 

Definition 15 (relation) A relation is a pair (r, E) where t, the signature, 
or the type, of the relation, is a tuple of type I ^ T , where T is a set of sets, 
and E, the extent of the relation, is a subset of CART (t). 

Example 13 (t, E) where r has the empty set as index set. CARt(t) has one 
element, the empty tuple. There are two possibilities for E: the empty set and 
the singleton set containing the empty tuple. 

The following example emphasizes the distinction between a relation and a 
binary relation. 

Example 14 A relation (r, E) with r G 2 ^ T can be represented without loss 
of information by the binary relation (t(0), r(l), {(tg, ii) | t G E}) . Conversely, 
let {S,T,E) be a binary relation and let t € 2 {S,T} be such that tq = S 
and Ti = T . This binary relation is represented without loss of information as 
the relation {t,F) where F = {t e CARt(t) | (to,ti) G E}. 

The following example shows that set theory is of potential interest to 
databases. 

Example 15 Let t be as in Example^ The actually existing addresses are 
but a relatively small subset E o/CART(r). The relation {t,E) is a relational 
format for the information represented by this set of addresses. 



22 



Example 16 The Euclidean plane is a set of tuples that have index set {X, Y} 
and where both elements are reals. That is, the Euclidean plane is CART(r), 
where r. 

A figure in the Euclidean plane is a subset of the Euclidean plane. So every 
figure in the Euclidean plane is a relation of type {X,Y} — > {TZ\. For example, 
the relation 

(r, {p G CARt(t) I pi- +pI^ 1}> 
is the unit circle with the origin as centre. 

6.1 Set-like operations 

Among relations of the same type certain relational operations are defined that 
mirror set operations: 

(t,Si)U(t,S2) {r,E,UE2) 
{t, El) n {t, E2) ''^ {T,EinE2) 
{t,Ei)\{t,E2) {t,Ei\E2) 

6.2 Relations with named tuples 

The extent of a relation (t, E) is the set of tuples E. These tuples are not named 
or ordered. Naming of the tuples can be achieved by a set S of names and a 
function f in S E. If we want to name all tuples, / needs to be surjective. 

It is also be possible to name the tuples of a relation by means of a subset 
/' of / (where / is the index set of r), if t | /' uniquely identifies t. That is, 
if to i = ^1 i I' implies, for all to,ti e E, that to = ti. Sometimes such an 
/' is a singleton set {i}. A common example is where E consists of tuples t 
describing employees and tt is the social insurance number. 

6.3 Projections and cylinders 

Recall that projections are defined (Definition [T^ as restrictions of tuples. 
Canonical set extensions of projections are therefore defined on sets of tuples. 
As extents of relations are sets of tuples, projections are also defined, as canon- 
ical set extensions, on extents of relations. 

Definition 16 (projection) Let t be in I T , where T is a set of disjoint 
sets and I is an index set. The projection on J of the relation (r, E) is written 
7rj((T, E)) and is defined to be the relation {t I J, {t I J \ t G E}). 

Example 17 If we have a relation r = (r, E) where r is 



number 


street 


city 


state 


zip 




A* 


A* 


A* 


AA 
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and E is the set of tuples that represent addresses of subscribers, then i: ^^.j^y g-^fj^i^^ir) 
is a relational format for the cities where there is at least one subscriber. 

Example 18 Let r be the relation {t,E) where t — 3 ^ {{a.b}} and E = 
{{a, a, a), {a,a,b), {b,a,b)}. 

An example of a projection is 

TTsW = (2^ {{a, 6}}, {(a, a), (6, a)}) 

We cannot represent the extent of e.g., 7r|o,2}('') with angle brackets because the 
angle brackets presuppose an index set in the form of n. Instead we say, 

m2}W = ({0,2}^{{a,6}},i?) 






2 


a 


a 


a 


b 


b 


b 



where E = 



Projections are more often used as canonical set extensions on sets of tu- 
ples than on individual tuples. As projections are typically not injective, they 
typically do not have an inverse. But every function does have an inverse set 
extension. The inverse set extensions of projections are especially useful. 

Let / be an index set with a subset J. If 5 is a set of tuples with index set 
J, then we can ask: What tuples t with index set / are such that Trj{t) G S? 
This is the definition of inverse set extension: it is defined for all functions, 
projections or not. So 

7rj\S)^{t\7rjit)eS}. 

Example 19 In Examvle MfA ^^Qij applied to the extent of Tr^Q ij{r) andn^^^y 
applied to the extent of 7r{o,2} o-fe, respectively, 

{{a,a,a), {a,a,b), {b,a,a), {b,a,b)} 

{{a, a, a), (a, b, a), (a, a, b), (a, 6, b), {b, a, b), (b, b, b)} 

Given a relation r, one can also wonder about relations that have r as pro- 
jection. There is a largest such, which is the "cylinder" on r. This gives the 
idea; to get the signatures right, see the following definition. 

Definition 17 (cylinder) Fori e {0, 1}, let Ti be in li — > Xi, such that tq and 
Ti are summable; Ti is a set of sets. Let {tq,Eq) be a relation. The cylinder in 
Iq U Ji on (tqjEq) is written as tt^Jj^^ ((tq, i?o)) '^'^'^ '-s defined to be relation 

{to + Ti, {t e cart(to + Ti)\t llo^ Eo}). 

The notation tt"-^ suggests some kind of inverse of projection. The suggestion 
is inspired by facts such as tt/q (tt^Jjj^ ((tq, i?o))) = {tq^Eq). 



24 



Example 20 Consider CART(r), with t in {X, Y, Z} {T^}, cmd think of it as 
the three-dimensional Euclidean space. Let the relation r be (r, {t £ CARt(t) | 
tx+ty + t'z — 1})' which is the unit sphere with centre in the origin. The 
projections 

T{x,y} W = {r i {X, r}, {t e tt^x,y}CAKt{t) \t\+tl< 1}) 
T^{z.Y}{r) = (r i{Z,r},{te^{2^y}CART(r) 14+4 < 1}) 
7r{z,x}W = (ri{Z,X},{ie^{z,x}CART(T) |4+4<1}> 

can he thought of as the two-dimensional projections of the sphere. 
The one- dimensional projections 

^x{r) = (r i {X},{t £ 7rxCART(T) | 4 < 1}) 
^y(r-) = (r [ {F}, {t e 7ryCART(T) | 4 < 1}) 
^z(r) - (r i {Z}, {t e 7rzCART(r) | 4 < 1}) 

can be thought of as the one- dimensional projections of the sphere. 

The cylinder i:^^ {■Kx{r)) looks like an infinite slab bounded by two planes 
parallel to the Y, Z -plane through the points with Y and Z coordinates and 
with X coordinates —1 and +1. This slab is just wide enough to contain the 
sphere. 

TT^^TTxW) n TTy^TTYir)) H TTz^{TTz{r)) 

is the intersection of three such slabs perpendicular to each other, so it is the 
smallest cube with edges parallel to the coordinate axes that contains the sphere. 
It is a Cartesian product. 

The cylinder TT^x Y}i''^{x,Y}{f)) looks like, well, a cylinder. Its axis is par- 
allel to the Z axis. The unit disk in the X, Y -plane with its centre at the origin 
is a cross-section. 

\x,Y}i'^{x,Y}{'r)) n T^{z,x}'y'^{z.x}{r)) H \z,Y}i'^{z,Y}{r)) 

is the intersection of three such cylinders. It is a body bounded by curved planes 
that properly contains the sphere r and is properly contained in the box. 

Definition 18 (join) For i G {0,1} let there be relations {Ti,Ei), with Ti £ 
li Ti and Ti a set of disjoint sets. If ro and ri are summable, then the 
join of {tq,Eq) and {ti,Ei) is written as {tq,Eq) 1X1 {ti,Ei) and defined to be 
'^/"ou/i ((^0, i?o)) n TTf^u,^ ((ri, Si)). 

The intersection in this definition is defined because of the assumed summa- 
bihty of To and ti. The signature of (tq, Eq) M {ti,Ei) is tq + ti. 

7 Application to relational databases 

Codd [3] proposed to represent tlic information in "large banks of format- 
ted data" as a collection of relations. This proposal has been so successful 
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that databases are ubiquitous and that most of these conform to Codd's rela- 
tional model for data. The success of the relational model is due to the fact 
that its mathematical nature has made more manageable the complexity that 
would have prevented the earlier models to support the enormous growth that 
databases have experienced. 

I have selected relational databases as example of an application in computer 
science where elementary set theory is useful. This is because the notion of 
relation is far from clear in the early database literature. As examples I have 
selected Codd's original paper ^ and UUman's widely quoted textbook [TT|. 

7.1 The relational model according to Codd 

In Codd ^ we find the following definition, the original one, in section 1.3 "A 
Relational View of Data" : 

The term relation is used here in its accepted mathematical sense. 
Given sets Si, . . . ,S'„ (not necessarily distinct), i? is a relation on 
these n sets if it is a set of n-tuples each of which has its first element 
from ^i, its second element from 5*2, and so on. We shall refer to Sj 
as the jih domain of R. 

Note that Sj, not j, is the domain. Thus Si and Sj may be the same set, 
even though i ^ j. The above quote continues with: 

For expository reasons, we shall frequently make use of an array 
representation of relations, but it must be remembered that this 
particular representation is not an essential part of the relational 
view being expounded. An array which represents an n-ary relation 
R has the following properties: 

1. Each row represents an n-tuple of R. 

2. The ordering of rows is immaterial. 

3. All rows are distinct. 

4. The ordering of columns is significant — it corresponds to 
the ordering S\, . . . ,Sn of the domains on which R is defined 
(see, however, remarks below on domain-ordered and domain- 
unordered relations). 

5. The significance of each column is partially conveyed by label- 
ing it with the name of the corresponding domain. 

Codd gives as example of such an array the one shown in Figure El He 
observes that this example does not illustrate why the order of the columns 
matters. For that he introduces the one in Figure [7| 

He explains Figure [T] as follows. 
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supply (supplier part project quantity) 
12 5 17 

Figure 6: Codd's first example: domains all different. 



component (part part quantity) 
1 5 9 

Figure 7: Codd's second example: domains not all different. 



. . . two columns may have identical headings (indicating identical 
domains) but possess distinct meanings with respect to the relation. 

We can take it that in Figure H we have n = 3, Si =82 — part, and 
5*3 — quantity. As Si,...,Sn need not all be different, columns can only 
identified by {1, . . . , n}. 

Codd goes on to point out that in practice n can be as large as thirty and 
that users of such a relation find it difficult to refer to each column by the 
correct choice among the integers 1, . . . ,30. According to Codd, the solution is 
as follows. 

Accordingly, we propose that users deal, not with relations, which 
are domain-ordered, but with relationships, which are domain-unordered. 

One problem is the term "domain-ordered" . The term suggests that the 
relation is ordered by the domains Si, ... , Sn. But, as Codd warns us, these 
sets are necessarily distinct. If there are fewer than n domains, then they cannot 
order the relation. 

Another problem with this passage is the introduction of "relationships" as 
distinct from "relations" . Codd notes in his description of the relational view 
of data that "the term relation is used in its accepted mathematical sense" . He 
would have had a hard time to find an accepted mathematical sense for the 
distinction between "relation" and "relationship" . 

Codd started out with the bold idea that the data that need to be man- 
aged in practice can be organized as relations "in their accepted mathematical 
sense" . Within one page he was forced to retract from this promising start 
to get bogged down in the murky area of "domain-ordered relations" versus 
"domain-unordered relationships" . 

The cause of the difficulty is that Codd seemed to regard it as somehow 
unmathematical for the index set / to be anything that differs from {1, . . . , n}. 
In this paper, the exposition leading up to Definition ^] is intended to ensure 
that one is not even tempted to entertain such a misconception. 
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7.2 The relational model according to Ullman 

The reader may have thought that the difficulties in Codd |S| would soon be 
straightened out as this original proposal became mainstream computer science, 
and, as such, the subject of widely quoted textbooks. Let us look at one of these, 
the one by J.D. Ullman 

[TT] introduces "The Set-Theoretic Notion of a Relation" (page 43), a notion 
there also called "the set-of-lists" notion of a relation. This is distinguished from 
"An Alternative Formulation of Relations" , one that is called "relation in the 
set-of-mappings sense" . 

The set-list-lists notion of a relation is any subset of the Cartesian product 
Di X • • • X Dk, where Di,. .. ,Dk are domains. While Codd was careful to 
say that the domains need not be distinct, says that there are k domains. 
But this is probably not intended. uses attributes Ai, . . . , to name the 
columns of a tabular representation of a relation. does not say whether 
these are distinct, but that is probably intended. 

In the set-of-lists type of relation, columns are named by the indexes 1, . . . ,k. 
Apparently, the attributes Ai, . . . , Ak are redundant comments on the columns. 

Starting on page 44, in the "Alternative Formulation" section, JT] makes 
the observation that the attributes can be used to index the domains instead 
of the indexes {1, . . . , fc} in the set-of-lists type of relation. When the indexes 
are attributes, one has a "relation in the set-of-mappings sense" . This kind of 
relation is illustrated with examples rather than defined. 

[TT] observes that in the practice of database use, relations are sets of map- 
pings. This make one wonder why the other kind, with its redundant numerical 
indexes, was introduced. The answer comes on page 53, in the section on rela- 
tional algebra. 

Recall that a relation is a set of fc-tuples for some k, called the arity 
of the relation. In general, we give names (attributes) to the compo- 
nents of tuples, although some of the operations mentioned below, 
such as union, difference, product, and intersection, do not depend 
on the names of the components. These operations do depend on 
there being a fixed, agreed-upon order for the attributes; i.e. they 
are operations on the list style of tuples rather than the mapping 
(from the attribute names to values) style. 

As we saw earlier, these operations do not depend on the index set of the 
type T of a relation (r, E) having "a fixed, agreed-upon order" for its elements. 

Definition ^| obviates the need for Codd's distinction between "domain- 
ordered" and "domain-unordered" relations and for UUman's distinction be- 
tween "sets-of-lists type relations" and "relations in the sets-of-mappings sense" . 
Codd's idea of a relational format for data was a promising one, but the special 
case of relations as subsets of Di x • • • x Z?„ is too special. It is perhaps not too 
late to revisit Codd's idea with relations according to Definition 1151 
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7.3 A reconstruction of the relational model according to 
set theory and logic 

Let us now see what the relational model for data would look like to someone 
who knows some set theory and who was only told the general idea of 3 without 
the details as worked out by Codd. We first look how data are stored, then how 
they are queried. Finally, in this section, we introduce a relational operation 
that facilitates querying. 

7.3.1 Relations are repository for data 

We have to start with what is implicit in the very idea of a database, relational 
or not. The information to be stored in a database concerns various aspects of 
things like employees in an organization, parts of an airplane, books in library, 
and so on. The general pattern seems to be that a database describes a collection 
of objects. 

There is no limit to the information one can collect about an object as it 
exists in reality. Hence one performs an act of abstraction by deciding on a 
set of attributes that apply to the object and one determines what is the value 
of each attribute for this particular object. A consequence of this abstraction 
is that it cannot distinguish between objects for which the attributes have the 
same value. The set of attributes has to be comprehensive enough that this does 
not matter for the purpose of the database. That is, within the microcosm of 
the database, one assumes that Leibniz's principle of Identity of Indiscernibles 
holds. 

The foregoing is summarized in the first of the following points. The remain- 
ing points constitute a reconstruction of the main ideas of a relational database 
as suggested by the first point. 

1. A database is a description of a world populated by objects. For each 
object, the database lists the values of the applicable attributes. 

2. The database presupposes a set of attributes, and for each attribute, a 
set of allowable values for this attribute. Such sets of admissible values 
are called domains. Let A be the set of attributes and let T be a set of 
disjoint domains. As each attribute has a uniquely determined domain, 
this information is expressed by a function, say t, that is of type A T . 

3. Not all attributes apply to every object. Hence, two objects may be similar 
in the sense that the same set of attributes applies to both. Let us say 
that objects that are similar in this sense belong to the same class. Hence 
a class is characterized by the subset of A that contains the attributes of 
the objects in the class. 

4. The description of each object is an association of a value with each at- 
tribute that is applicable to the object. That is, the object is represented 
by a tuple that is typed by r | /, where / is the set of attributes of the 
class to which the object belongs. 
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SUPPLIERS = 
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Figure 8: The relations for Example 12 II 



5. The objects of a class are described by tuples of the same type. As tuples 
of the same type are a relation, the class is a relation of that type. As 
objects need not belong to the same class, a database consists in general 
of multiple relations (r | /q, Eq), . . . ,{t I where /q, • • ■ , In-i 
are the attribute sets of the classes. 

6. The life cycle of a database includes a design phase followed by a usage 
phase. In the design phase A, T, and t G A ^ T is determined, as well as 
the subsets Iq, . . . , of A. This is the database scheme. In the usage 
phase the extents Eq, . . . , £"„_! are added and modified. With the extents 
added, we have a database instance. 

Note that the set of attributes of one class might be included in another. 
This suggests a hierarchy of classes. Note also that nothing is said about the 
nature of the values that the attributes might take. Values might be restricted 
to simple values like numbers or strings. Or they could be tuples of one of the 
relations. 

It would seem that whether to include these possibilities in a relational 
database would be a matter of trading off flexibility in modeling against sim- 
plicity and efficiency in implementation. Not so: these possibilities amount 
to an object-relational database, the desirability of which is subject to heated 
controversy. 

7.3.2 Queries, relational algebras, query languages 

According the relational model, relations are not only used as format for the 
data stored in a database, but also as format for queries; that is, for selections 
of data to be retrieved from the database. Thus the results of queries are 
relations. These relations depend on those that are stored in the database and 
must therefore be the result of operations on them. Let us see how we can use 
the operations introduced so far. 
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SELECT PNAME, CITY 

FROM SUPPLIERS, PARTS, PROJECTS 

WHERE PARTS. PID = PROJECTS.PID AND SUPPLIERS . SID = PARTS. SID 
AND RQTY <= PQTY 

Figure 9: An SQL query to the relations in Figure|H| 

Example 21 (Shim in Taos) Consider the relations shown in the tables in 
Figure Each table consists of a line of headings followed by the line entries 
of the tables. The line entries represent the tuples of the relations. Each table 
has three such lines. 

The following abbreviations are used. In the SUPPLIERS table, SID for sup- 
plier ID, SNAME for supplier name, and CITY for supplier city. In the PARTS 
table, PID for part ID, PNAME for part name, and PQTY for part quantity on 
hand. In the projects table, rid for project ID and rqty for part quantity 
required. 

We want to know the part names and cities in which there is a supplier with 
a sufficient quantity on hand for at least one of the projects. 

The relation specified by the SQL query in Figure El can be specified with 
relations according to Definitions El and with projection and join according to 
Definitions ^1 and ^1 respectively. 

Let the tables in Figure (HI be the relations suppliers = (tq, Eq), parts = 
{ti,Ei), and PROJECTS = (t2, i?2)- In addition, there is a relation in the query 
for which there is no table, namely the less-than-or-equal relation. Mathemati- 
cally, there is no reason to treat it differently from the relations stored in tables. 
Hence, we also include it as leq = {t^, E^). 

The index sets of tq, ti, T2 are the sets of the column headings of the 
tables for suppliers, parts, and projects, respectively: for tq the index 
set is { SID, SNAME, CITy}, for Ti it is { pid, pname, SID, PQTy}, for T2 it is 
{ rid, pid, rqty}. For T3 it is { rqty, pqty}. 

The extents E^, Ei, and E2 are as described in Figure [HI Moreover, E^ = 

{t £ CART(r3) I t RQTY < t pqty}- 

Consider now the relation 

'''{ PNAME, CITY} = SUPPLIERS M PARTS N PROJECTS M LEQ. (2) 

For the relations to be joinable, the signatures tq, ti, T2, and have to be 
summable. That is, for any elements common to their source sets, they have 
to have the same value. For example, the source sets of tq and ri have SiD 
in common. They both map SiD to its domain, which is the set of supplier 
IDs. Therefore tq and ri are summable; hence cities Xl parts is defined 
(Definition I18|) . Similarly with the other joins in the expression which has 
as value the relation described by the SQL query in Figure |H1 

In the query in Example 1211 every relation occurs at most once. In the 
following example we consider a query where this is not the case. 
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PARENT 



CHILD 



MARY 



JOHN 



PC 



JOHN 



ALAN 



MARY 



JOAN 



Figure 10 



Relation for Example 1221 



Example 22 (Mary, Alan) In Fiaure \T^ we show a table specifying a relation 
consisting of tuples of two elements where one is a parent of the other. It is 
required to identify pairs of persons who are in the grandparent relation. 

What distinguishes this query from the one in Example 1211 is that the rela- 
tions do not occur in the join as given, but are derived from the given relation. 
On the basis of the derived relations we create one in which the pairs are in the 
grandparent relation. In one of the SQL dialects this would be: 

SELECT PCO. PARENT, PCI. CHILD 
FROM PC AS PCO, PC AS PCI 
WHERE PCO. CHILD = PCI. PARENT 

In this query the derived tables are obtained via the linguistic device of renam- 
ing PC to PCo and PCi. 

7.3.3 An additional relational operation 

Do we have to introduce a special-purpose language to handle a simple query 
such as this one? In the following we show how an additional relational op- 
eration, "filtering" , is sufficient to handle, in combination with projection and 
join, not only this query, but more generally, to write relations expressions that 
mimic the queries of a powerful query language such as Datalog 0. 

The filtering operation acts on a relation and a tuple and results in a relation. 
Let the relation be (t, E) with t E I ^ T and T a set of disjoint sets. Let the 
tuple be p e / ^ y , where F is a set of objects that we think of as placeholders. 
In similar situations such placeholders are often called "variables" , which is fine, 
as long as we remember that they are elements of a set, and that there is nothing 
linguistic about them. 

For each such tuple p we can ask: is there a t d E such that t — s o p for 
some s E V UT? In that case we include s in the extent E' of a relation 
{(p,E'). See Figure [m 

Definition 19 (filtering) Let t be in I ~* T , where T is a set of disjoint sets 
and I is an index set. Let p I V be a tuple. Let tp € V T be such that 
T = ip o p. The filtering by p of the relation (r, E) is written (r, E) : p and is 
defined to be the relation {ip, {s G p{I) UT \ 3t Cz E. t ^ s o p}) . 

The condition t = Lpop ensures that the variables in V are typed compatibly 
with T. 
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Figure 11: Sets and functions involved in filtering. 

Example 23 Let I = 3, T = {7^}, V {x,y,z}, t e I ^ T, and E ^ 
{(u, V, w) g Ti? \ u X. V = w}. Let p ~ (x, x, z) . What is (r, E) : p? 

In this example we use the filtering operation to obtain the squaring relation 
as a special case of the multiplication relation. As T is a singleton set, there is 
only one possibility for r, and this is {Ti,Tl,TZ). The same holds for (p. In this 
way, (r, E) is the multiplication relation over the reals. 

Definition El now implies that the extent of (r, E) : p, the result of applying 
filtering with p to (r, E), is {s S {x, z} ^ TZ \ = Sz}- This can be explained 
as follows. We need to find all s € {x, z} ^ TZ such that there exists a. t G E 
with t — s o p. Now, t — s o p implies that Iq — {s o p)q — — s^- Similarly, 

tl = (S Op)i = = s^. 

That is, i = s o p has a solution s iff io = ^i- Indeed, p ~ {x,x,z) can be 
regarded as a pattern that a i G _B may or may not conform to. By filtering the 
multiplication relation's (r, E) tuples through pattern p, (r, E) : p becomes, in 
its way, the squaring relation. 

Let us now return to Example |52 to see how filtering can be used here. 
Suppose we have a set V = {x,y,z} and an index set / = {parent, child}. 
Let p and q both be in / ^ V, with p such that pparent = x, Pchild = 
and with q such that ^parent = U, 9CHILD = z. 

The relation 

T^{x,y}iPC ■■ p X PC:q) 

is the equivalent of the relation resulting from the SQL query. 

In this example, we have followed database usage in making the index set / = 
{parent, child} non-numerical. If we set / = 2, then we have the convenient 
notation {x, y) for p and (y, z) for q. The query then becomes 

T^{x^z}{PG : {x,y) M PC : {y,z)). (3) 

With the relational operations defined so far (the set-like operations, projec- 
tion, cylindrification, join, and filtering) we can define a wide variety of queries. 
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We do this by writing expressions in the informal language of set theory. This 
does not mean that a "query language" exists. Of course, to make such ex- 
pressions readable for a machine, they have to be formalized. Only then a 
"language" exists, and then only in a technical sense. Thus a "query language" 
should only arise as part of the user manual of a software package. It has no 
place in expositions of the relational data model. 

Similarly, as soon as we have relations and operations on them, an algebra 
exists. But this is only so in a technical and not in any substantial sense. Here 
is an example of what I mean by the existence of an algebra in a substantial 
sense. For example, we could observe that the set of integers is closed under 
addition, that addition has an inverse, and that zero is the neutral element 
under addition. That we then have an algebra in a substantial sense is borne 
out by the fact that this algebra is a group and that examples of groups exist 
that don't look like the integers at all, yet have certain interesting properties in 
common that are expressed in the usual group axioms and theorems. 

It happens to be the case that the set-like operations together with cylin- 
drification and certain special relations like the diagonals constitute an algebra 
worth the name, the cylindric set algebra. The operations were identified by 
Tarski. He only talked about an algebra when a significant theorem and in- 
teresting properties had been identified, and then only in abstracts less than a 
half page long . Only much later, when significant algebraic results were 
obtained, were cylindric algebras made the subject of a longer publication 

In the database world things work differently: already in the first few years, 
Codd proposed two query languages for the relational data model. One, the 
relational calculus, was to be of a nonprocedural nature, so as to make it easy for 
users to relate the query to their intuitive perception of the real- world situation 
described by the database. The other query language, the relational algebra, 
is distinguished by operations on relations as algebraic objects. This language 
was intended to facilitate query optimization. However, for some decades the 
query language used most widely in practice is SQL, which has neither of these 
characteristics. 

I have avoided introducing a relational algebra or a query language. Instead 
I have limited myself to introducing operations on relations: the set-like op- 
erations, projection, cylindrification, join, and filtering. The closest I come to 
a query language are the expressions in the informal language customary and 
universal to all mathematical discussions. 

In spite of these limitations, it is striking how close a query such as the one in 
Equation Q comes to its equivalent in Datalog j^, one of the query languages 
proposed in the literature. In Datalog, this query would be 

ANSWER(a;, z) <— PC{x, y), PC(y, z). (4) 

One of the advantages of Datalog over the relational calculus of Codd is that 
Datalog is a subset of the first-order predicate calculus, a relational calculus that 
antedates computers by half a century. The predicate calculus fully embodies 
Codd's ideal of a declarative relational language. The reason Codd could not 
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adopt it, is that a relational algebra counterpart was not known at the time. 
Recent research [SJ ^] has remedied this deficiency. 

8 Conclusion 

When I advocate basic set theory for computer science, I don't mean finding the 
right formula to copy. Neither this paper, nor any of the books may have the 
right formula. For example, neither Bourbaki ^ nor Halmos 0] give exactly 
the notion of relation required for databases. The difference is mathematically 
trivial, but crucial for the relational data model. Neither [T], nor [J, nor this 
paper can anticipate such trivial but crucial variations. 

For the mathematicians consulted by Codd, the distinction between n = 
{0, . . . , 71 — 1} and more general index sets in a relation was trivial, so they 
used the most familiar, which is n. But the failure to allow for general index 
sets has continued to trouble the relational data model for years. For Codd it 
was dangerous to know more than the general idea that a relation is a set of 
tuples, that a tuple is a function, and that arguments for a function don't have 
to be numbers. And for us it is dangerous to copy the formulas from Codd. We 
should remember his idea, and take it from there, as best we can. 

Hence the advice given by Halmos ^ "Read this, and forget it." Or as 
Goethe said: "Was du ererbt von deinen Vatern hast, erwirb es um es zu be- 
sitzen."^ 

In other words, don't copy the formulas. 
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